首页> 外文OA文献 >Silent error detection in numerical time-stepping schemes
【2h】

Silent error detection in numerical time-stepping schemes

机译:数值时间步进方案中的无声错误检测

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Errors due to hardware or low level software problems, if detected, can befixed by various schemes, such as recomputation from a checkpoint. Silenterrors are errors in application state that have escaped low-level errordetection. At extreme scale, where machines can perform astronomically manyoperations per second, silent errors threaten the validity of computed results. We propose a new paradigm for detecting silent errors at the applicationlevel. Our central idea is to frequently compare computed values to thoseprovided by a cheap checking computation, and to build error detectors based onthe difference between the two output sequences. Numerical analysis provides uswith usable checking computations for the solution of initial-value problems inODEs and PDEs, arguably the most common problems in computational science.Here, we provide, optimize, and test methods based on Runge-Kutta and linearmultistep methods for ODEs, and on implicit and explicit finite differenceschemes for PDEs. We take the heat equation and Navier-Stokes equations asexamples. In tests with artificially injected errors, this approach effectivelydetects almost all meaningful errors, without significant slowdown.
机译:如果检测到由硬件或低级软件问题引起的错误,则可以通过各种方案来修复该错误,例如从检查点重新计算。静默错误是应用程序状态中的错误,它们已摆脱低级错误检测。在极端规模下,机器每秒可以执行许多天文运算,而无声错误威胁着计算结果的有效性。我们提出了一种用于在应用程序级别检测静默错误的新范例。我们的中心思想是经常将计算值与廉价检查计算所提供的值进行比较,并根据两个输出序列之间的差异来构建错误检测器。数值分析为ODE和PDE的初值问题的解决提供了可用的检查计算,这可能是计算科学中最常见的问题。在此,我们提供基于Runge-Kutta和ODE的线性多步方法的方法,优化方法和测试方法,关于PDE的隐式和显式有限差分方案。我们以热方程和Navier-Stokes方程为例。在具有人工注入错误的测试中,此方法可有效检测几乎所有有意义的错误,而不会显着降低速度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号